Compiling a Task-Based Corpus for the Analysis of Learner Language in Context
نویسندگان
چکیده
Corpora in linguistics and computational linguistics have traditionally been assembled from data sources such as newspaper texts, books and, more recently, the web. While these sources provide large quantities of language data, typically very little or nothing is known about the context under which the text has been produced. The only information an analysis can refer to is the text itself, e.g., when a sentence is analyzed using the preceding sentences for disambiguation. However, language is always produced in a concrete extra-linguistic context. This contextual setting includes world knowledge and situational knowledge, i.e., the aspects of world knowledge which are relevant to interpret the given text and the concrete task and situation that the language was produced for.
منابع مشابه
Metadiscourse Markers in a Corpus of Learner Language: The Case of Iranian EFL Learners
Different issues have been probed in learner corpus research since the late 1980s.However, taking the im- portance of meta discourse markers (MDMs) in signposting academic discourse, their use in Iranian EFL learners‟ academic essays is an area of research in need of a more serious analysis. Contributing to this line of investigation, this paper reports a corpus-based study of the use of MDMs i...
متن کاملHedges in English for Academic Purposes: A Corpus-based study of Iranian EFL learners
Hedges, as tools to express tentativeness and doubt, have been studied in plenty of research papers in the Iranian EFL research setting. However, their use in a learner corpus, portraying Iranian learner English, is in need of more research attention. With this end in view, this study aimed at investigating how Iranian EFL learners who have majored in English-related fields in Iran deployed hed...
متن کاملThe Development and Validation of Language Learner Beliefs Scale in the Iranian EFL Context
Unlike teacher beliefs, there has been a dearth of study regarding EFL learner beliefs. The reason can be that Horwitz (1987) and the existing literature has predominantly been in an ESL context. The present study reports the development and validation of a scale to measure the learner beliefs about language learning in Iranian EFL contexts. Using a combination of verbal creativity method, inte...
متن کاملHow textbooks (and learners) get it wrong: A corpus study of modal auxiliary verbs
Many elements contribute to the relative difficulty in acquiring specific aspects of English as a foreign language (Goldschneider & DeKeyser, 2001). Modal auxiliary verbs (e.g. could, might), are examples of a structure that is difficult for many learners. Not only are they particularly complex semantically, but especially in the Malaysian context ...
متن کاملEvaluation of High School English Course Books in Iran: Task Types in Focus
This study sought to examine the type and frequency of tasks in the Iranian high school English course books (Prospect 1, 2, 3 & English Book 1, 2, 3). The corpus was analyzed based on Nunan’s (1999) framework composed of five main task types, namely cognitive, interpersonal, linguistics, affective, and creative. To this end, the whole content of the aforementioned course books went through con...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009